Sentiment Analysis by Augmenting Expectation Maximisation with Lexical Knowledge

نویسندگان

  • Xiuzhen Zhang
  • Yun Zhou
  • James Bailey
  • Kotagiri Ramamohanarao
چکیده

Sentiment analysis of documents aims to characterise the positive or negative sentiment expressed in documents. It has been formulated as a supervised classification problem, which requires large numbers of labelled documents. Semi-supervised sentiment classification using limited documents or words labelled with sentiment-polarities are approaches to reducing labelling cost for effective learning. Expectation Maximisation (EM) has been widely used in semi-supervised sentiment classification. A prominent problem with existing EM-based approaches is that the objective function of EM may not conform to the intended classification task and thus can result in poor classification performance. In this paper we propose to augment EM with the lexical knowledge of opinion words to mitigate this problem. Extensive experiments on diverse domains show that our lexical EM algorithm achieves significantly higher accuracy than existing standard EM-based semi-supervised learning approaches for sentiment classification, and also significantly outperforms alternative approaches using the lexical knowledge.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SenticNet 5: Discovering Conceptual Primitives for Sentiment Analysis by Means of Context Embeddings

With the recent development of deep learning, research in AI has gained new vigor and prominence. While machine learning has succeeded in revitalizing many research fields, such as computer vision, speech recognition, and medical diagnosis, we are yet to witness impressive progress in natural language understanding. One of the reasons behind this unmatched expectation is that, while a bottom-up...

متن کامل

Self-training from labeled features for sentiment analysis

Sentiment analysis concerns about automatically identifying sentiment or opinion expressed in a given piece of text. Most prior work either use prior lexical knowledge defined as sentiment polarity of words or view the task as a text classification problem and rely on labeled corpora to train a sentiment classifier. While lexicon-based approaches do not adapt well to different domains, corpus-b...

متن کامل

Sentimantics: Conceptual Spaces for Lexical Sentiment Polarity Representation with Contextuality

Current sentiment analysis systems rely on static (context independent) sentiment lexica with proximity based fixed-point prior polarities. However, sentimentorientation changes with context and these lexical resources give no indication of which value to pick at what context. The general trend is to pick the highest one, but which that is may vary at context. To overcome the problems of the pr...

متن کامل

Mining the Sentiment Expectation of Nouns Using Bootstrapping Method

We propose an unsupervised bootstrapping method to generate a new type of affect knowledge base: the sentiment expectation of nouns (e.g., “high salary” is desirable while “high price” is usually undesirable, because people have opposite sentiment expectation towards “salary” and “price”). A bootstrapping framework is designed to retrieve patterns that might be used to express complaints from t...

متن کامل

Expectation Maximisation for Sensor Data Fusion

The expectation maximisation algorithm (EM) was introduced by Dempster, Laird and Rubin in 1977 [DLR77]. The basic of expextation maximisation is maximum likelihood estimation (MLE). In modern sensor data fusion expectation maximisation becomes a substantial part in several applications, e.g. multi target tracking with probabilistic multi hypothesis tracking (PMHT), target extraction within pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012